NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Randomized Algorithms for Symmetric Nonnegative Matrix Factorization

https://doi.org/10.1137/24M1638355

Hayashi, Koby; Aksoy, Sinan G; Ballard, Grey; Park, Haesun (March 2025, SIAM Journal on Matrix Analysis and Applications)

Symmetric Nonnegative Matrix Factorization (SymNMF) is a technique in data analysis and machine learning that approximates a symmetric matrix with a product of a nonnegative, low-rank matrix and its transpose. To design faster and more scalable algorithms for SymNMF, we develop two randomized algorithms for its computation. The first algorithm uses randomized matrix sketching to compute an initial low-rank approximation to the input matrix and proceeds to rapidly compute a SymNMF of the approximation. The second algorithm uses randomized leverage score sampling to approximately solve constrained least squares problems. Many successful methods for SymNMF rely on (approximately) solving sequences of constrained least squares problems. We prove theoretically that leverage score sampling can approximately solve nonnegative least squares problems to a chosen accuracy with high probability. Additionally, we prove sampling complexity results for previously proposed hybrid sampling techniques which deterministically include high leverage score rows. This hybrid scheme is crucial for obtaining speedups in practice. Finally, we demonstrate that both methods work well in practice by applying them to graph clustering tasks on large real world data sets. These experiments show that our methods approximately maintain solution quality and achieve significant speedups for both large dense and large sparse problems.
more » « less
Free, publicly-accessible full text available March 31, 2026
Retrieving Top-k Hyperedge Triplets: Models and Applications

https://doi.org/10.1109/BigData62323.2024.10825860

Niu, Jason; Amburg, Ilya D; Aksoy, Sinan G; Sarıyüce, Ahmet Erdem (December 2024, Proceedings)

Complex systems frequently exhibit multi-way, rather than pairwise, interactions. These group interactions cannot be faithfully modeled as collections of pairwise interactions using graphs and instead require hypergraphs. However, methods that analyze hypergraphs directly, rather than via lossy graph reductions, remain limited. Hypergraph motifs hold promise in this regard, as motif patterns serve as building blocks for larger group interactions which are inexpressible by graphs. Recent work has focused on categorizing and counting hypergraph motifs based on the existence of nodes in hyperedge intersection regions. Here, we argue that the relative sizes of hyperedge intersections within motifs contain varied and valuable information. We propose a suite of efficient algorithms for finding top-k triplets of hyperedges based on optimizing the sizes of these intersection patterns. This formulation uncovers interesting local patterns of interaction, finding hyperedge triplets that either (1) are the least similar with each other, (2) have the highest pairwise but not groupwise correlation, or (3) are the most similar with each other. We formalize this as a combinatorial optimization problem and design efficient algorithms based on filtering hyperedges. Our comprehensive experimental evaluation shows that the resulting hyperedge triplets yield insightful information on real-world hypergraphs. Our approach is also orders of magnitude faster than a naive baseline implementation.
more » « less
Full Text Available
High-order Line Graphs of Non-uniform Hypergraphs: Algorithms, Applications, and Experimental Analysis

https://doi.org/10.1109/IPDPS53621.2022.00081

Liu, Xu T.; Firoz, Jesun; Aksoy, Sinan; Amburg, Ilya; Lumsdaine, Andrew; Joslyn, Cliff; Praggastis, Brenda; Gebremedhin, Assefaw H. (May 2022, 2022 IEEE International Parallel and Distributed Processing Symposium (IPDPS))

Full Text Available
Parallel Algorithms for Efficient Computation of High-Order Line Graphs of Hypergraphs

https://doi.org/10.1109/HiPC53243.2021.00045

Liu, Xu T.; Firoz, Jesun; Lumsdaine, Andrew; Joslyn, Cliff; Aksoy, Sinan; Praggastis, Brenda; Gebremedhin, Assefaw H. (December 2021, 2021 IEEE 28th International Conference on High Performance Computing, Data, and Analytics (HiPC))

Full Text Available

Search for: All records